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relative onset time of the first formant and which are characteristically associated 
with the phonemes /do/ and /to/ were presented to two groups of subjects or 
labelling. The stimuli presented to each group were identical except for a one-step 
shift on the stimulus continuum. It was found that the response probabilities over the 
stimuli common to the two stimulus sets were essentially identical for the two groups 
Subjects did not evidence a shift in labelling judgments con*.-, mitant with a shift on the 
stimulus continuum. Such a shift would occur if subjects were simply partitioning the 
continuum psychophysically. It is concluded that categorical perception was observed 
and that the stimulus-shift paradigm may be a new technique for isolating the 
categorical perception effect. (Author /DO) 
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Synthetic speech stimuli which varied along a continuum 
defined in terms of the relative onset time of the first for- 
mant and which are characteristically associated with the 
phonemes /do/ and /to/ were presented to 2 groups of subjects 
for labelling. The stimuli presented to each group were 
identical except for a one-step shift on the stimulus con- 
tinuum. It was found that the response probabilities over 
the stimuli common to the 2 stimulus sets were essentially 
identical for the 2 groups. Subjects did not evidence a 
shift in labelling judgments concomitant with a shift on the 
stimulus continuum. Such a shift would occur if subjects were 
simply partitioning the continuum psychophysically. It is 
concluded that categorical perception was observed and that 
the stimulus-shift paradigm may be a new technique for isolat- 
ing the categorical perception effect. 



It has been shown recently that the perception of certain 
language sounds is of a peculiarly categorical nature. Identifica- 
tion data collected from human subjects for synthesized speech 
sounds over a variety of phonemic contrasts reveal that perception 
is discontinuous, i.e., the stimuli are sorted into one of two 
categories on pretty much an all-or-none basis (Liberman et al. , 
19575 1961.) The data taken to define categorical perception are 
in the form of mutually overlapping f requency-of-identification 
gradients which resemble complementary step functions. In addition 
to the discontinuous nature of these identification functions, it 
has been shown that: a) there is an abrupt increase in latency 
of the identification responses at the point where the two cate- 
gories adjoin, b) the topographies of the verbal identification 
responses tend to be discrete— are in themselves categorical, and 
c) stimulus discriminability on a pair-comparison basis is better 
at the category boundary than any any point within the category 
(see Figure 1) . Liberman has used such data to construct a theory 
which accounts for the categorical quality of speech perception 
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terms of the discreteness of the articulatory movements necessary 
for the production of certain speech sounds (for a review of the 
data and the theory see Lane, 1965). 



Insert Figure 1 About Here 



Others (Cross & Lane, 1962; Cross, Lane, & Sheppard, 1965; 

Lane, 1965) have observed that the paradigm for categorical 
perception is essentially a generalization paradigm in which two 
stimuli on a given continuum exercise a great deal of control over 
two incompatible responses. When this notion is tested by present- 
ing stimuli intermediate to two well conditioned controlling stimuli, 
irrespective of the continuum, the finding is that the probability 
of each response is high and constant over the stimulus range immedi- 
ate to its respective controlling stimulus, with an abrupt decrement 
to zero over the stimulus range immediate to the stimulus which 
controls the other response. Generalization gradients obtained in 
this way resemble the complementary step functions obtained for the 
identification of synthetic speech stimuli. In these other studies, 
the collateral data taken by Liberman to define categorical percep- 
tion-latency peaks, discrimination enhancement, and discreteness 
of response topography— —have also been shown to occur. These facts 
have led to the conclusion that categorical perception is not unique 
to speech stimuli and that an extensive discrimination history in- 
volving two mutually incompatible responses is paradigmatic of cat- 
egorical perception (Lane, 1967). 

Categorical perception is of interest to a student of psycho- 
physics: although it resembles simple stimulus equisection, it has 
additional properties. Of special significance is the enhancement 
of discrimination at category boundaries. This discontinuity is 
not characteristic of discrimination with such stimulus attributes 
as amplitude, duration, or frequency. In tact, the relative 
monotonicity of changes in discrimination probabilities on these 
latter continue, is the basis for the construction of Fechnerian 
scales for the growth of sensation. 
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i;ow, another property of equisection is the independence of 
equisection judgments from the absolute values of the stimuli to be 
categorized. Bisection of a 10 db interval of sound intensity will 
be at a point about 6 db above the lower intensity , whether the 
values to be bisected are 60 to 70 db, 65 to 75 db, or 70 to 80 db. 
This observation leads to an interesting quest? What are the 
properties of equisection on linguistic continua? Will the bi- 
section function for synthetic speech stimuli ranging over some 
distance, x, on a continuum be the same as or different from the 
bisection function for stimuli ranging over the same distance 
but at a slightly different locus on the continuum? In other words, 
will the positioning of the perceptual midpoint on a synthetic 
speech continuum be independent of the absolute stimulus values, as 
it would be in the case of simple bisection, or will it stay rela- 
tively fixed and thus reflect the categorical nature of the per- 
ception of phonemes? 



Method 

Synthetic speech stimuli which are typically identified as 
continuous with respect to the two English phonemes /do/ and /to/ 
were presented to two groups of subjects for identification. 

Subjects served individually in sessions lasting approximately 30 
minutes. The stimuli were synthesized at Haskins Laboratories on 
a device known as the Pattern Playback (Liberman et al. , 1961) • 
Each jroup was presented with one of two sets of stimuli. The sets 
were identical except for a one-step shift of the relative onset 
time of the energy in the first and second formants of the synthetic 
speech signal. There were six stimuli in each stimulus set. Each 
stimulus was presented an equal number of times in a triad, sand- 
wiched, as it were, between the two stimuli defining the range of 

% 

the stimulus set. The subjects rated each stimulus as identical 
to either the first or third member of the triad. Instructions 

to the subject were as follows: 

In this experiment you will judge language sounds. 

You will hear two kinds of sounds — /do/ and /to/. They 
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will be presented to you in triads. In each triad, the 
first stimulus will always be /do/ and the third stimul- 
us will always be /to/. The middle stimulus will be 
either /do/ or /to/. After each triad depress the left 
hand switch if you think the middle stimulus is more 
like the first stimulus (/do/) or the right hand switch 
if you think the middle stimulus is more like the third 
stimulus (/to/). Be sure to respond each time but make 
only one response after each triad. Do not try to make 
an equal number of responses on each switch as / do/ and 
/to/ do not necessarily appear with equal frequency in 
the middle position. Have you any questions? 

The tape recorded stimuli were presented in random order to 
the subject througl a set of audiometer headphones (Sharpe, Type A, 
model HA-10) . Responses were r corded automatically on a multiple 
channel event recorder when the subject depressed one of the two 
switches to indicate his response. Each stimulus triad was pre- 
sented 10 times. There were 10 Ss in each group. 



Results and Discussion 

From Figure 2 it can be seen that the two group identification 
functions, when plotted over the range of stimuli used in the total 
study, are essentially congruent; response probabilities are about 
the same for both groups <»t any given absolute cutback value. The 
difference between the intersection points for the two sets of data 
is on the order of 1 msec. , a difference not at all close to the 
10 msec, shift by which the two stimulus sets differed. So close 
a matching would not be expected if the stimuli had been tone 
intensities, lengths of lines, etc. 



Insert Figure 2 About Here 



The abrupt disparity observed at one point on the continuum 
might be interpreted as indicative of a shift in the identification 
function in accord with the shift in the stimulus set. This inter- 
pretation is weakened, however, when the data from only the most 
reliable subjects, those subjects for whom there were no abrupt 
discontinuities in the identification function, are analyzed (see 
Figure 3). These subjects yielded gradients which tended to be 
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superimposed even more than those of the groups as a whole. Yet 
one would expect any shift due to context to be most readily 
apparent with the reliable subjects, since it seems reasonable to 
assume that their behavior was most under stimulus control. 



Insert Figure 3 About Here 



A more likely explanation for the slight disparity between the 
two gradients is in terms of response alternation. Assuming chat 
stimulus control dictated that the response probabilities be 
identical for the two groups, then one of the groups would find 
itself emitting many more of one kind of response than of the 
other. The tendency of this group would be then occasionally to 
emit a few responses of the second kind, irrespective of which 
st'* .ftulus was presented — a tendency to balance things up, so to 
speak. The instructions were worded so as to minimize such a 
possibility, but evidently to no avail. 

Conclusions 

The present study suggests that a test using identification 
probabilities and a shift in stimulus context might point to differ- 
ences in categorical responding in situations involving speech 
stimuli as against situations in which some simple stimulus con- 
tinuum is employed. The data obtained show that identification 
functions from two groups of subjects for two overlapping sets of 
synthetic speech stimuli tend to be superimposed, although the 
effect was contaminated by a tendency toward response alternation 
on the part of the subjects. A study is now underway with the 
stimulus triads arranged in both ascending and descending order. 
Eliminating the tendency to alternation should lead to better 
congruence of the two functions. 
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Footnotes 

■^The research reported herein was performed in part pursuant 
to Contract OEC-3-6-061784-0508 with the U. S. Department of 
Health, Education, and Welfare, Office of Education, under the 
provisions of P. L. 83-531, Cooperative Research, and the provisions 
of Title VII, P. L. 85-864, as amended. This research report is one 
of several which have been submitted to the Office of Education as 
Studies in Language and Language Behavior. Progress Report VII. , 

September 1, 1968. 

o 

Now at Claremont Graduate School. 

3 An abrupt discontinuity was defined as any non-monotonic 
change in the identification function in the region of transition 
from 100% to 0% response probabilities. 



Figure Captions 

Figure 1. Observed distributions of discrimination accuracy 
and identification probability and latency for the /do/-/to/ 
continuum. From Lane (1967) . 

Figure 2. Identification functions for two groups of Ss 
for two overlapping sets of synthetic speech stimuli. Each point 
is the arithmetic mean percentage across all Ss for /do/ responses 
only. The /to/ response percentages are complementary. 

Figure 3. Identification functions for the best discriminating 
Ss in each of two groups of Ss for two overlapping sets of synthetic 
speech stimuli. The criterion for ’’best discriminating" was a 
lack of any abrupt discontinuity in a S.’ s identification function. 
Each point is the arithmetic mean percentage across all selected 
Ss for /do/ responses only. The /to/ response percentages are 
complementary . 
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